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Abstract 

Self gravitating systems (SGS) in the Universe are generally thought 
to be non-extensive, and often show long-tails in various distribution func- 
tions. In principle, these non-Boltzmann properties are naturally expected 
from the peculiar property of gravity, long-range and unshielded. There- 
fore the ordinary Boltzmann statistical mechanics would not be applica- 
ble for these self gravitating systems (SGS) in its naive form. In order 
to step further, we quantitatively investigate the above two properties, 
non-extensivity and long-tails, by explicitly introducing various models of 
statistical mechanics. We use the data of CfA II South redshift survey 
and apply the count-in-cell method. We study four statistical mechanics, 
(1) Boltzmann, (2) Fractal, (3) Renyi, and (4) Tsallis, and use Akaike 
information criteria (AIC) for the fair comparison. 

1 Introduction 

The Long-range unshielded force gravity forms unique structures in the Universe 
called self-gravitating systems (SGS). Almost all the relevant structures such as 
stars, galaxies, clusters and super-clusters of galaxies, belong to this category. 

These SGS generally (a) show non-extensive properties, and (b) have no 
absolute equilibrium state] the systems never stop their evolution toward singu- 
larity. 

The former property (a) is apparent in principle. Actually, the mass density 
p of iso-thermal SGS of size r is given by 

M <v 2 > _ 2 , , 

9 4tit 3 /3 4nGr 2 /3 T 1 ' 

where we used the virial theorem: K=—V/2, where K and V are respectively the 
kinetic energy and the potential energy of the system. Since the general exten- 
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sive structures have definite basic elements and therefore have constant density, 
the above scale-dependent density clearly shows the non-extensivity of SGS. 
This property reflects the long distance singularity of gravity. In other words, 
SGS cannot be divided into uniform independent ingredients (non- additive) in 
general. 

The latter property (b) is also apparent in principle. Actually, the partition 
function Z of SGS in the temperature T{— again using the virial theorem, 

is given by 



which has an essential singularity at the short distance limit. Therefore the ab- 
solute equilibrium state does not exist. This property reflects the short distance 
singularity of gravity. 

Since the ordinary Boltzmann statistical mechanics is based on the exten- 
sivity and the existence of the absolute equilibrium state, the above properties 
of gravity make it inapplicable in principle. 

Despite the property (b), SGS frequently show gwasi-equilibrium states which 
are often characterized by scaling properties; especially (c) the long-tail in var- 
ious distribution functions are marvelous. This fact may suggest the existence 
of any underling fundamental statistical mechanics for SGS. 

There are many examples of the above properties (a) and (c) in SGS: The 
actual mean mass density of a structure is proportional to r~ 1J (de Vaucouleurs 
197O)(Eq.0J is too simplified). Moreover in the extreme gravitating case of 
Black Hole, the 'entropy' is not proportional to the mass M but to the area 
M 2 . The property (c) is based on the absence of the characteristic scale in the 
Hamiltonian of SGS. The well known Holtzmark distribution of force F acting 
on a single star in a uniform star cluster is exactly the stable Levy distribution 
with index 3/2 and behaves as F~ 5 / 2 in large force limit (Binney et al. 1987). 
The correlation function of galaxies and the clusters show power law behavior 
£ oc r~ 186 (Borner 2002). 

As explained in the above, the ordinary Boltzmann statistical mechanics 
cannot be applied to SGS in the naive form, and therefore any generalization 
or new statistical mechanics would be necessary in order to explain the quasi- 
cquilibrium states of SGS. 

The necessary next step will be as follows, on which we will concentrate in 
this paper: 

(A) To choose astronomical data which reflect the above properties (a), (b), 
and (c) in the fundamental level. The data should be investigated not 
simply by the apparent form of the distribution functions but by internal 
structure of the theory that distinguish various statistical mechanics. 
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(B) To introduce a fair criterion which can specify the correct statistical me- 
chanics among various proposed theories which have different number of 
free parameters. 

For the purpose (A), we focus on the data of CfA II South (Huchra J., et 
al., 1999) galaxy distribution survey l . Especially, we use count-in-cell method, 
in which the probability f v (N) of finding a fixed number of galaxies in a fixed 
volume plays a central role. For the purpose (B), we focus on the Akaike Infor- 
mation Criteria (AIC) (Akaike 1973) 2 . This method will be most appropriate 
to compare different theories with different number of free fitting parameters. 

We have chosen the following four theories/models of statistical mechan- 
ics, classified according to the properties (a) non-extensivity and (c) long-tails. 
(Table 1.) 

(1) The Boltzmann statistical mechanics which is of course extensive from 
its construction. The associated Boltzmann distribution function does not 
possess long tail. Since this theory alone never explain the CfAII data (see 
below), we introduce an extra parameter b as in the reference by Saslaw 
W. C. & Hamilton A. J. S. (1984). This parameter measures the possible 
deviation from the complete virial equilibrium. 

(2) The (mono-)fractal-space model as a simple example of non-extensive 
theory with short tail distribution. 

(3) The Renyi statistical mechanics as an appropriate example of extensive 
statistical mechanics with long-tail distribution. 

(4) Thc Tsallis statistical mechanics (Nakamichi A., Joichi I., Iguchi O., 
Morikawa M., 2001), as a typical non-extensive theory. This has the same 
long-tail distribution as Renyi. It should be noted that we use proper 
Tsallis statistical mechanics with Escote averaging, which should be dis- 
tinguished from the older version of the normal averaging. Both Tsallis 
and Renyi statistical mechanics posses an extra parameter q which mea- 
sures the deviation from the ordinary Boltzmann statistical mechanics; 
they reduce to the latter in the limit q — ► 1 . 

Individual models/theories arc explained in detail below. 

We do not claim that this is the best available data, though it is uniform and large to 
some extent. 

2 We do not claim that this is the best available method; there are many extensions and gen- 
eralizations of AIC. These are unnecessarily complicated for the present purpose and therefore 
we choose the simplest version of AIC. 
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Table 1: Four models/theories of statistical mechanics classified by (non)- 
extensivity of the model and the long/short tails of the associated distribution 
function. 

2 Various models/theories of statistical mechan- 
ics 

2.1 Boltzmann statistical mechanics 

We consider the galaxy distribution which is supposed to obey the Boltzmann 
statistical mechanics with grand canonical ensemble. In the ordinary Boltzmann 
statistical mechanics, the tail of the distribution function exponentially reduces. 
We phenomcnologically generalize the theory and introduce the virial parameter 
b, which measures the deviation from the complete dynamical-equilibrium: 

pV = NT(1 - b), (3) 

where p denotes the pressure, V the volume of the system, N the number of 
galaxies contained in this volume, and T the temperature defined by the velocity 
dispersion. 

Usually, collision-less SGS attain the dynamical equilibrium before the ther- 
mal equilibrium: r therma] w ^T dynamical > r dynamical (see J. Binney et al. 1987 
for detail) . Therefore the parameter b phenomenologically represents some class 
of quasi-equilibrium . 

The distribution function /(0) is defined to be the probability of finding no 
galaxy in any part of the volume V. The explicit form is given by 

/ (0) = e _ ^ (1_fc) = e -™(l-6)47rr 3 /3 

where n is the galaxy number density and N = nV (Saslaw 1984). This is the 
generating functional of the general function f(N), which is defined to be the 
probability to find the N galaxies in the fixed volume V. The general expression 
of f(N) is given in the Appendix. 

From the observational data of CfA II South galaxy redshift survey, we calcu- 
late the probability f(N) using the count-in-cell method. The best fit parameter 



4 




200 400 600 800 100012001400 



Figure 1: The void probability /(0) in the Boltzmann statistical mechanics. 
The horizontal axis is the distance scale in the unit of rcdshift [km/sec]. The 
diamonds with error bars are CfA II South Observations. Note that the number 
of the actual data points is about ten times larger than that shown in this figure 
for better visibility. The void probability /(0) calculated from the Boltzmann 
statistical mechanics is plotted by the broken line with the best fit parameter 
b = 0.523. There exist quite a large deviation between the theory and the 
observation. 



b is calculated to minimize the Akaike Information Criteria (AIC) for /(0). The 
comparison of the theory and observation is shown in Fig. 1. The diamonds 
with error bars are CfA II South Observations. The void probability /(0) cal- 
culated from the Boltzmann statistical mechanics is plotted by the broken line 
with the best fit parameter b = 0.523. Quite a large deviation between the the- 
ory and the observation implies the inapplicability of the Boltzmann statistical 
mechanics even in the modified version. 

We fix the value of the parameter and with this value, general probabilities 
f(N) are calculated and are compared with the CfA II South data in Fig. 2. 

2.2 Boltzmann statistical mechanics with fractal matter 
distribution 

Various observational data suggest the idea that the matter distribution in the 
Universe is (multi-)fractal at least in some limited scale region (Kurokawa et 
al. 1999; 2000). In this section, we investigate a simple fractal model with 
the ordinary Boltzmann statistical mechanics. In this model, the system is 
non-extensive in the sense that the simple addition of the two identical frac- 
tal distribution in the three dimensional space generally destroys the scaling 
property of the original fractal system. On the other hand, the tail of the distri- 
bution function exponentially reduces since the model is based on the ordinary 
Boltzmann statistical mechanics. 

We consider a mono-fractal matter distribution with the fractal dimension 
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Figure 2: The probability functions /(l) — /(4) in the Boltzmann statistical 
mechanics. The horizontal axes are the distance scale in the unit of rcdshift 
[km/sec]. The diamonds with error bars are CfA II South Observations. The- 
oretical predictions using Boltzmann statistical mechanics with the parameter 
b = 0.523 arc shown by the broken lines. Note that the number of the actual 
data points is about ten times larger than that shown in this figure for better 
visibility. 



a, and suppose the system obeys Boltzmann statistical mechanics with grand 
canonical ensemble, as in the previous section. 

The void probability is simply given by / (0) = e~ N = er nV , but the 
number N non-trivially depends on the scale r in our fractal model. Thus /(0) 
is given by 



/(0) 



-N 



cxp 



*/2 7 



r(l + a/2) 



(5) 



This expression is shown in Fig. 3 with the observational data of CfAII redshift 
survey. By using the same method as in the previous section, we found the best 
fit value for the parameter a = 2.8821. 

With this best fit parameter value, general probabilities f(N) are calculated 
using the method in the Appendix, and they are compared with the CfA II 
South data in Fig. 4. 



2.3 Renyi statistical mechanics 

In this section, we study Renyi statistical mechanics, which is extensive but the 
associated distribution function has long-tail. Renyi statistical mechanics is a 
generalization of the ordinary Boltzmann theory by introducing a new form of 
entropy called Renyi entropy. This entropy is very similar to the information 
measure often used in multi- fractal models. However the use of this entropy as 
the starting point of the solid statistical mechanics is not clear. Therefore we 
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Figure 3: The same as Figure 1. but in the fractal model based on the Boltz- 
mann statistical mechanics. The theoretical prediction with the best fit param- 
eter a — 2.8821 is shown by the broken line. There exist slight disagreement 
between the theoretical model and the observational data. 
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Figure 4: Same as Figure 2. but in the fractal model with the Boltzmann 
statistical mechanics. The parameter of fractal dimension is fixed as a — 2.8821. 
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concentrate on the application of this statistical mechanics in this paper without 
discussion of its physical foundation. 
The Renyi entropy is defined by 

S[ P ] = (6) 

where the parameter q measures the deviation from the ordinary Boltzmann 
entropy. Actually in the limitg — > 1 , this reduces to the Boltzmann entropy. 

When we compose two independent systems A and B, the distribution func- 
tion is given by pij[A, B] = pi[A]pj[B], and the composed entropy becomes 

Sa+b = Sa + Sb, (7) 

which clearly shows extensivity. 

Let us suppose that the galaxy distribution obeys the Renyi statistical me- 
chanics with the grand canonical ensemble. Then the distribution function is 
given by 

PN,E = [l-^A{E-E- f i(N -N)}}^ (8) 

which maximizes the above entropy. 

As is clearly seen from the above expression, the distribution function has 
a long-tail with power-law shape; a significant characteristic which distinguish 
this formalism and the Boltzmann formalism. 

Since the total entropy for N galaxies, Sn, is simply the sum of N entropies 
for a single galaxy: Sn = sN, the Euler relation becomes the ordinary one: 

S= E + » V T -»N (9) 

Using the above expression, the void probability becomes 

f(0) = {l + (l-q)Ns}- 1 . (10) 

Because the change of the parameter s can be renormalized into the change of 
the parameter q, the probability is independent of s. This fact naturally reflects 
the extensiveness of the present entropy. 

As in the previous section, using the count-in-cell method, we compare the 
theoretical model and the observational data of CfAII South. The best fit value 
of the parameter q is q = —0.042. By fixing this value, we further obtain general 
probabilities f(N) and they are compared with the CfA II South data. 

2.4 Tsallis statistical mechanics 

Tsallis C. (1988) proposed the non-extensive entropy 

S[p] = j— — ^ , (11) 
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Figure 5: Same as Figure 1. but in Renyi statistical mechanics. Our best 
fit calculations with the parameter q = —0.042 and CfA II South Observations 
(diamonds with error bars) are plotted. There exist slight disagreement between 
the theoretical model and the observational data. 
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Figure 6: Same as Figure 2. but in Renyi statistical mechanics using parameter 
q = -0.042. 
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where the distribution function Pi is the Escort distribution and is related with 
the bare distribution function pi by: 



w 

3=1 

The Escort distribution is identified to be the physical distribution and is used to 
obtain observable averaging. The distribution function is derived by maximizing 
the above entropy functional with appropriate constraints. 

When we compose two independent systems A and B, and for the distribu- 
tion pij [A, B] = pi [A]pj [B] , composed Tsallis entropy satisfies the non-extensive 
relation: 

S a +b = S A + S B + (I - q)S A S B , (13) 

as is easily calculated by using the above entropy form. 

Let us suppose the galaxy distribution obeys the Tsallis statistical mechanics 
with grand canonical ensemble. As in the case of fractal model and Rcnyi 
statistical mechanics, we do not need the virial parameter b in Tsallis statistics. 

The distribution function 

Pn.e = ^-[l-^-{E-E-fi{N-N)}}^ (14) 

maximizes Tsallis entropy. Here T = CT should be identified as the 'tempera- 
ture' of the system. The factor C appears everywhere in this theory. 

Note that the parameter q measures the extent of deviation from the ordi- 
nary Boltzmann statistical mechanics; the distribution function reduces to the 
Boltzmann distribution forg — > 1. 

Tsallis distribution is identical to the Renyi distribution except the 'temper- 
ature' and the normalization factor. Therefore the Tsallis distribution also has 
the long-tail. 

Since the system is non-extensive, the total entropy of the TV galaxies should 
be calculated by using the composition law of the entropy given above. The 
expression is given by (Nakamichi A., et al., 2001) 

s K = <■ + (■-_«)'>''-■ , (15) 

where the parameter s is identified to be the entropy for a single galaxy. 
Then we obtain the generalized Euler relation a la Tsallis: 



{l+(l-g)S}log[l + (l-g)S] _ E + pV - fiN 
1-q T 



(16) 



As in the previous two sections, using the count-in-cell method, probability 
of void becomes 

/(0) = {1 + (1 - + iVlog{l + (1 - q)s}]r^. (17) 
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Figure 7: Same as Figure 1. but in Tsallis statistical mechanics. Our best fit 
(q = — 5.7, s = 0.16) theory (broken line) and the CfA II South observations 
(diamonds with error bars) are plotted. The fit seems to be the best among the 
four statistical mechanical models. 



This prediction is compared with the observational data of CfAII South. We 
found the best fit parameters q = —5.7 and ,s = 0.16. Almost complete fitting 
is remarkable. 

Further, we can calculate the general probability of finding TV galaxies in a 
given volume V, and compare with the CfA II South data in the figures 7,8. 

We observe that for the higher order probability functions, the prediction 
by the Tsallis statistical mechanics is worse than the previous Renyi statistical 
mechanics, but is better than the Boltzmann statistical mechanics. 

We have used the Escort averaging and not used the normal (old type) 
averaging in Tsallis statistical mechanics because the latter averaging is not a 
consistent theory. Tsallis originally proposed the non-extensive entropy using 
the normal averaging pi (see for example, Tsallis C, Mendes R. S., Plastino A. 
R., 1998) 

S]p] = (E f 9) ~ 1 , (18) 

i - q 

where distribution function pi in normal averaging is determined so as to max- 
imize the entropy functional. 

If we suppose the galaxy distribution obeys the above normal-averaging Tsal- 
lis statistical mechanics with grand canonical ensemble, the distribution function 

Pn,e = i-[l-^-{E-E-tx{N-N)}]^ (19) 

maximizes the Tsallis entropy. Here T = CT is the 'temperature' of the system. 
Also in this case, the distribution function approaches the ordinary Boltzmann 
distribution function for the parameter q approaches to 1. 
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Figure 8: Same as Figure 2. but in Tsallis statistical mechanics with Escote 
averaging, using parameters q = —5.7 and s = 0.16. 

The void probability is obtained in the same manner as the case of Escote 
averaging, but the expression has a slightly different form: 

/(0) = {1 + (1 - q)s}r^[l + JVlog{l + (1 - q)s}]^. (20) 

This time we have found the best fit values q = —1000 and s = 10 200 . Even 
in this best fit case, there exists still significant difference between the theory 
and observational data. The extremely large values above imply the absurdity 
of the theory 

3 Comparison of statistical mechanics using Akaike 
Information Criterion 

In order to calculate f(N), we have studied several kinds of statistical mechan- 
ics, in which the number of free parameters are different. In general, a model 
with much more parameters can fit the observational data better. However 
good physical theories must be simple and include the minimum number of free 
parameters. We need a fair measure to choose the correct statistical model 
to describe the galaxy distributions. One such measure will be the Akaike In- 
formation Criterion (AIC), (Akaike H. 1973; see for example, K. P. Burnham 
and D. R. Anderson 2002), which imposes a penalty on larger number of free 
parameters. 

AIC measure is given by 

AIC = —2 x (maximum log liklihood) 

+ 2 x (number of parameters) (21) 

which is the unbiased estimator for expected mean log-liklihood. The first term 
on the right hand side tends to decrease as more parameters are introduced for 
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better fitting, while the second term then becomes larger. This is the tradeoff 
between the number of parameters and the variance. Better model has smaller 
AIC measure. 

Here the "parameters" denotes the free parameters to be estimated by the 
maximum likelihood analysis. For the present argument, they are the variance 
estimated by the maximum likelihood analysis, and the other free parameters 
in each theoretical model. 

Here the "maximum log liklihood" is calculated as follows. When we com- 
pare observational data sets {xi, Vi} 1<i<N and the theoretical model y = 
P(x), we assume that the errors (distance between the data and the theo- 
retical model) distribute according to the Normal distribution: / (xi, yi) = 
, 1 „ e~ Uz( yi ~ p ( Xi ^ . Then the log-likelihood function becomes 

JV 

1(9) ee ]Tln f(x h y t ) (22) 

i=l 

= -^In^TraV^E^-^)) 2 ' 

where 9 represents all the parameters in the model. The maximum likelihood 
estimate equations are 

^ = 0, (23) 



09 

ae (6>) 

da 2 2o 2 ' 2a 

i—1 



N 1 A , , s ,o 



The first equation determines the parameters 9. The second equation determines 
the variance a 2 of the error to be estimated value a 2 as 



a 2 



1 N 

-^(Vi-Pixi)) 2 . (24) 



i=\ 



Then the maximum log-likelihood function becomes I (jfj = — y — y \h2t:& 2 . 
Therefore AIC measure reduces to 

AIC = (#data points) x (1 + log 10 2ir (variance)) 

+ 2 x (1 + number of free parameters) (25) 

We have compared the AIC values for the probability of finding N galaxies 
in each theoretical model. In the figure 9, such AIC values are plotted. 

For the void probability (JV = 0), we observe that the Tsallis statistical me- 
chanics with the Escote averaging has a remarkably small AIC. On the other 
hand for higher order probabilities ((1 < JV)), the Renyi statistical mechanics 
has much smaller AIC than the Tsallis with the Escote averaging. Therefore 
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Figure 9: AIC measure for probability of finding N galaxies in each theoretical 
model for f(N), (0 < N < 9). Vertical axis denotes the value of AIC. The 
smaller AIC, the theoretical model is better. Horizontal axis denotes the number 
of galaxies N of / (N) . AIC of Boltzmann statistical mechanics is plotted by 
crosses, fractal space model in Boltzmann statistical mechanics is plotted by 
diamonds, Renyi statistical mechanics is plotted by squares, Tsallis statistical 
mechanics with the Escote averaging is plotted by triangles, and the old type 
Tsallis statistical mechanics is plotted by stars. 
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only from these data, it is not possible to select the correct model among Tsal- 
lis and Renyi statistical mechanics. Therefore we cannot determin the (non)- 
extensivity of SGS. However, it is clear that the other models have large AIC 
values for all the probabilities. Therefore we may conclude that the long tail 
of the distribution function would be essential for describing the distribution of 
galaxies. 

4 Discussions 

We have studied two specific properties of SGS, non-extensivity of the entropy 
S, and the long-tails in the distribution functions, by explicitly introducing sev- 
eral models of statistical mechanics. By using the data of CfA II South redshift 
survey and the count-in-cell method, we have studied four statistical mechan- 
ics: (1) Boltzmann, (2) Fractal, (3) Renyi, and (4) Tsallis. These statistical 
mechanics have been evaluated by the Akaike information criteria (AIC) for the 
fair comparison. 

In our study, it has been essential that the expression of the probabilities 
f(N) depends not only the distribution function but also the explicit form of 
the entropy. The long-tail property of distribution function affects the former 
dependence and the (non-)extensivity of entropy affects the latter dependence. 
Therefore we could distinguish the Renyi and Tsallis theories despite that they 
have almost the same form of distribution functions Pn.e (see for example, 
Arimitsu T. & Arimitsu N. preprint 2002 ). 

First we have seen that in Boltzmann statistical mechanics, it is unable 
to fit the CfAII South data of galaxy distribution even if we introduce the 
free parameter b which measures the deviation from the complete dynamical- 
equilibrium, or the free parameter a which measures the fractal property of the 
space. 

Then we have investigated two statistical mechanics which have long tail 
distributions. Both the Renyi extensive statistical mechanics and the Tsallis 
non-extensive statistical mechanics are far better than the above two models 
based on Boltzmann statistical mechanics. Therefore the long-tail in the distri- 
bution function will be essential for the correct statistical mechanics to describe 
SGS. 

On the other hand for the (non-) extensivity, we cannot have clear conclu- 
sions. This is because the Tsallis non-extensive theory can fit /(0) better than 
Renyi extensive theory, and the latter theory can fit f(N) (N > 1 ) better than 
the former theory. This incomplete conclusion may reflect that the size of the 
CfA survey data may still be small and much comprehensive survey would derive 
the complete answer to our question. We believe our method can in principle 
select the correct theory of statistical mechanics to describe SGS out of many 
candidate theories. 

Acknowledgement One of the authors (MM) would like to thank Prof. Hiroshi 
Hasegawa for very useful discussions and suggestions. 
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A Probabilities P (N) and the generating func- 
tional 



When we calculate the provability P (N) of finding N galaxies in a given volume 
V, we use the expression 

(-n) N d N 

P(N) = ^^P(0) (26) 

where n is the number density of galaxies. (White S. D. M. 1979) 

The proof of this expression is originally given by S. White in the above 

reference with cumbersome arguments. In this Appendix, we extract the essence 

of the proof and try to clarify the logic as much as possible. 

For a general statistical variable <j> (x) , which is a field on three-dimensional 

space, the partition function is given by 



Z [J] = / exp 



- / J (x) <j> (x) d 3 x 



v 



(27) 



exp ( — J (x) 4> {x) d x 
expW [J] 



v 



where the brackets represent the functional integral of the field 4> ( x ) ( or the 
sum over possible functions <j) (x)), and J (x) is a source field. The whole part of 
the cumulants or the connected correlation functions, denoted as the brackets 
with a suffix c, are defined by the above equation. 

Wc now specify the field </> (x) as (discontinuous, non-averaged) number den- 
sity of galaxies: 



oo 



i=l 

and accordingly the functional integration is reduced to the following form 

y ' Jv V V V K ' 

since a distribution of galaxies at {xi, X2,X3, ...x n , ■■■} determines a field <f> (x). 
It is obvious that the probability of finding one galaxy within a small vol- 
ume d 3 x\ around the space position x\ is P{x\)d?x\ = (<f> {x\)) d 3 xi, and the 
joint probability of finding one galaxy within a small volume d?x\ around the 
space position x\ and the other within d 3 X2 around X2 is P (x\, x^) d 3 xid 3 X2 = 
(xi) (j> (X2)) d 3 X\d 3 X2, and so on. Much more useful quantities are the prob- 
ability of finding no galaxy P (0) within a fixed volume V. Suppose the volume 
V is divided into small pieces {v\, ...,%}, each of which is of order v = V/M. 
Then this void probability is 

P(0) = (l[ (l-<f>(x m )v m )\ (30) 
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M 



exp 



O (Mv 2 



Taking the limit M — > oo with fixed V, and therefore O (Mv 2 ) = O (V 2 /M), 
the void probability simply becomes 



(31) 



= \ exp 



- / 0(x)d J x 



= Z[1] 



Similarly, the probability of finding one galaxy within a small volume d 3 x\ 
around the space position x\ and finding no other galaxies is given by 



P(xi ; 0)d 3 xi 



M 



lim ( (j) (xi) Ui TT (1 - (j) fam) »ro) 
M—>oo \ 



m=2 



lim ( (/) (xi) exp 



M 



4>{x m )v„ 



O (Mv 2 



= (</>(xi)exp 



— I 4> (x) d x 



d x\ 



General probability for N galaxies is given by 

P (xi, xn ; 0) d 3 x\...d 3 XN 
I 4> (x\) ...tfi (xn) d 3 x\...d 3 XN'x 

= ( n (i-<Hx)<px) 

\ x&V— {xi,x 2 ,--,xm} 



(f>{xi) ...4> (x N ) exp 



4> (x) d 3 x 



d x-y...d xn 



On the other hand, the partition function can be expanded as 



Z[J] = 



°° (-1)' f (^(Xl)^^) 



1=0 



y (-1)' 



x J (xi) J(x 2 ) , ... J (x z ) 

P [xix 2l -,xi) 

x J(xi) J{x 2 ) , -J{xi) 
Y.dV x dV 2 ...dVi 



In general, 



P(xi, ...,Xtv) 



(<f)(x 1 )...(f)(x N )} 



<9J (xi) ...<9J (xjv) 



./— 



(32) 



(33) 



(34) 



(35) 
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and 



P(xi, ...x N ; 0) 



4>(xi) ...<f>{x N )e fv' 
d N Z [J] 



(36) 



dJ (xi) ...dJ (xjv) 



The probability of finding exactly N galaxies with the volume V, using 
P {x\X2, ...xn ; 0) but not P (x±X2, ■■■xn), is given by 



P(N) 



1 

ivT 
(-i) 



P(xix 2 , -x N ; 0) 



v N 



(37) 



AH 



d N z [j] 



Vn dJ(xi) ...dJ{x N ) 



j — i 



If we factor out the mean number density n from the density field <j) (x) as 
<f> (x) = n (1 + 8 (a;)), where 6 (x) denotes deviation from the average, then we 
have 



dZ [J] 



and similarly 



dJ(x) 

d N z[j] 



dZ[l] 
dn 



Vi Jv N dJ{xi) ■ ■ -dJ(x N ) 



N 9 N Z[1] 
dn N 



Using this form and Eq. (|31|l for Eq. (|37|l . we finally have 



P(N) 



(-n)" d 
N\ dn N 



P(0) 



(38) 



(39) 



(40) 



as desired. 
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